Hmm Composition-based Rapid Model Ad Gmm Adaptation Evaluation O

نویسنده

  • Masaki Ida
چکیده

When a speech recognition system is used in a real environment, its recognition performance is affected by the surrounding noise. Most types of additional noise as well as SNRs are difficult to predict, so there is a mismatch between the training and test data. We need a method to deal with this problem. In this paper, we propose an HMM compositionbased model adaptation method with a priori noise GMM adaptation against the mismatch between different types of noise in noisy data. We also prepare multiple HMMs for several SNRs and select the one that can most effectively, based on the acoustic likelihood, deal with unknown SNRs. We carried out speech recognition experiments in noisy environments by using an AURORA2 task test set B. The results show 53% improvement in word accuracy from the baseline system with one-second real noise data used for adaptation. The performance is equivalent that of conventional HMM composition methods using ten-second real data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker adaptation of context dependent deep neural networks based on MAP-adaptation and GMM-derived feature processing

In this paper we propose a novel speaker adaptation method for a context-dependent deep neural network HMM (CD-DNNHMM) acoustic model. The approach is based on using GMMderived features as the input to the DNN. The described technique of processing features for DNNs makes it possible to use GMM-HMM adaptation algorithms in the neural network framework. Adaptation to a new speaker can be simply ...

متن کامل

A scalable approach to using DNN-derived features in GMM-HMM based acoustic modeling for LVCSR

We present a new scalable approach to using deep neural network (DNN) derived features in Gaussian mixture density hidden Markov model (GMM-HMM) based acoustic modeling for large vocabulary continuous speech recognition (LVCSR). The DNN-based feature extractor is trained from a subset of training data to mitigate the scalability issue of DNN training, while GMM-HMMs are trained by using state-o...

متن کامل

GMM-derived features for effective unsupervised adaptation of deep neural network acoustic models

In this paper we investigate GMM-derived features recently introduced for adaptation of context-dependent deep neural network HMM (CD-DNN-HMM) acoustic models. We improve the previously proposed adaptation algorithm by applying the concept of speaker adaptive training (SAT) to DNNs built on GMM-derived features and by using fMLLR-adapted features for training an auxiliary GMM model. Traditional...

متن کامل

Rapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments

This paper describes a multi-template unsupervised speaker adaptation based on HMM-Sufficient Statistics. Multiple class-dependent models based on gender and age are used to push up the adaptation performance while keeping adaptation time within few seconds with just one arbitrary utterance. Adaptation begins with the estimation of speaker‘s class from the N-best neighbor speakers using Gaussia...

متن کامل

Speaker state recognition using an HMM-based feature extraction method

In this article we present an efficient approach to modeling the acoustic features for the tasks of recognizing various paralinguistic henomena. Instead of the standard scheme of adapting the Universal Background Model (UBM), represented by the Gaussian ixture Model (GMM), normally used to model the frame-level acoustic features, we propose to represent the UBM by building monophone-based Hidde...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002